911 research outputs found

    Complete genome sequences of three novel human papillomavirus types, 175, 178, and 180.

    Get PDF
    We report the characterization of three novel human papillomavirus (HPV) types of the genus Gammapapillomavirus. HPV175 and HPV180 were isolated from a condyloma. HPV178 was isolated from healthy skin adjacent to an actinic keratosis

    Benchmarking the next generation of homology inference tools

    Get PDF
    Motivation: Over the last decades, vast numbers of sequences were deposited in public databases. Bioinformatics tools allow homology and consequently functional inference for these sequences. New profile-based homology search tools have been introduced, allowing reliable detection of remote homologs, but have not been systematically benchmarked. To provide such a comparison, which can guide bioinformatics workflows, we extend and apply our previously developed benchmark approach to evaluate the 'next generation' of profile-based approaches, including CS-BLAST, HHSEARCH and PHMMER, in comparison with the non-profile based search tools NCBI-BLAST, USEARCH, UBLAST and FASTA. Method: We generated challenging benchmark datasets based on protein domain architectures within either the PFAM + Clan, SCOP/Superfamily or CATH/Gene3D domain definition schemes. From each dataset, homologous and non-homologous protein pairs were aligned using each tool, and standard performance metrics calculated. We further measured congruence of domain architecture assignments in the three domain databases. Results: CSBLAST and PHMMER had overall highest accuracy. FASTA, UBLAST and USEARCH showed large trade-offs of accuracy for speed optimization. Conclusion: Profile methods are superior at inferring remote homologs but the difference in accuracy between methods is relatively small. PHMMER and CSBLAST stand out with the highest accuracy, yet still at a reasonable computational cost. Additionally, we show that less than 0.1% of Swiss-Prot protein pairs considered homologous by one database are considered non-homologous by another, implying that these classifications represent equivalent underlying biological phenomena, differing mostly in coverage and granularity. Availability and Implementation: Benchmark datasets and all scripts are placed at (http://sonnhammer.org/download/Homology_benchmark). Contact: [email protected] SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online

    Characterization of human papillomavirus subtype 72b.

    Get PDF
    We report the characterization of human papillomavirus (HPV) subtype 72b of the genus Alphapapillomavirus isolated from an oral rinse sample of a healthy woman. The HPV72b L1 open reading frame (ORF) was 90.2% identical to that of HPV72, indicating a subtype close to the border of a novel HPV type

    Domain tree-based analysis of protein architecture evolution

    Get PDF
    Understanding the dynamics behind domain architecture evolution is of great importance to unravel the functions of proteins. Complex architectures have been created throughout evolution by rearrangement and duplication events. An interesting question is how many times a particular architecture has been created, a form of convergent evolution or domain architecture reinvention. Previous studies have approached this issue by comparing architectures found in different species. We wanted to achieve a finer-grained analysis by reconstructing protein architectures on complete domain trees. The prevalence of domain architecture reinvention in 96 genomes was investigated with a novel domain tree-based method that uses maximum parsimony for inferring ancestral protein architectures. Domain architectures were taken from Pfam. To ensure robustness, we applied the method to bootstrap trees and only considered results with strong statistical support. We detected multiple origins for 12.4% of the scored architectures. In a much smaller data set, the subset of completely domain-assigned proteins, the figure was 5.6%. These results indicate that domain architecture reinvention is a much more common phenomenon than previously thought. We also determined which domains are most frequent in multiply created architectures and assessed whether specific functions could be attributed to them. However, no strong functional bias was found in architectures with multiple origins

    RTK: efficient rarefaction analysis of large datasets

    Get PDF
    Motivation: The rapidly expanding microbiomics field is generating increasingly larger datasets, characterizing the microbiota in diverse environments. Although classical numerical ecology methods provide a robust statistical framework for their analysis, software currently available is inadequate for large datasets and some computationally intensive tasks, like rarefaction and associated analysis. Results: Here we present a software package for rarefaction analysis of large count matrices, as well as estimation and visualization of diversity, richness and evenness. Our software is designed for ease of use, operating at least 7x faster than existing solutions, despite requiring 10x less memory. Availability and implementation: C ++ and R source code (GPL v.2) as well as binaries are available from https://github.com/hildebra/Rarefaction and from CRAN (https://cran.r-project.org/). Contact: [email protected], [email protected]

    Fast genome-wide functional annotation through orthology assignment by eggNOG-mapper

    Get PDF
    Orthology assignment is ideally suited for functional inference. However, because predicting orthology is computationally intensive at large scale, and most pipelines relatively inaccessible, less precise homology-based functional transfer is still the default for (meta-)genome annotation. We therefore developed eggNOG-mapper, a tool for functional annotation of large sets of sequences based on fast orthology assignments using precomputed clusters and phylogenies from eggNOG. To validate our method, we benchmarked Gene Ontology predictions against two widely used homology-based approaches: BLAST and InterProScan. Compared to BLAST, eggNOG-mapper reduced by 7% the rate of false positive assignments, and increased by 19% the ratio of curated terms recovered over all terms assigned per protein. Compared to InterProScan, eggNOG-mapper achieved similar proteome coverage and precision, while predicting on average 32 more terms per protein and increasing by 26% the rate of curated terms recovered over total term assignments per protein. Through strict orthology assignments, eggNOG-mapper further renders more specific annotations than possible from domain similarity only (e.g. predicting gene family names). eggNOG-mapper runs ~15x than BLAST and at least 2.5x faster than InterProScan. The tool is available standalone or as an online service at http://eggnog-mapper.embl.de

    Cross-cultural adaptation and validation of the “spinal cord injury-falls concern scale” in the Italian population

    Get PDF
    Study design: Psychometrics study. Objective: The objective of this study was to develop an Italian version of the Spinal Cord Injury-Falls Concern Scale (SCI-FCS) and examine its reliability and validity. Setting: Multicenter study in spinal units in Northern and Southern Italy. The scale also was administered to non-hospitalized outpatient clinic patients. Methods: The original scale was translated from English to Italian using the “Translation and Cultural Adaptation of Patient-Reported Outcomes Measures” guidelines. The reliability and validity of the culturally adapted scale were assessed following the “Consensus-Based Standards for the Selection of Health Status Measurement Instruments” checklist. The SCI-FCS-I internal consistency, inter-rater, and intra-rater reliability were examined using Cronbach’s alpha coefficient and the intraclass correlation coefficient, respectively. Concurrent validity was evaluated using Pearson’s correlation coefficient with the Italian version of the short form of the Wheelchair Use Confidence Scale for Manual Wheelchair Users (WheelCon-M-I-short form). Results: The Italian version of the SCI-FCS-I was administered to 124 participants from 1 June to 30 September 2017. The mean ± SD of the SCI-FCS-I score was 16.73 ± 5.88. All SCI-FCS items were either identical or similar in meaning to the original version’s items. Cronbach’s α was 0.827 (p < 0.01), the inter-rater reliability was 0.972 (p < 0.01), and the intra-rater reliability was 0.973 (p < 0.01). Pearson’s correlation coefficient of the SCI-FCS-I scores with the WheelCon-M-I-short form was 0.56 (p < 0.01). Conclusions: The SCI-FCS-I was found to be reliable and a valid outcome measure for assessing manual wheelchair concerns about falling in the Italian population

    proGenomes: a resource for consistent functional and taxonomic annotations of prokaryotic genomes

    Get PDF
    The availability of microbial genomes has opened many new avenues of research within microbiology. This has been driven primarily by comparative genomics approaches, which rely on accurate and consistent characterization of genomic sequences. It is nevertheless difficult to obtain consistent taxonomic and integrated functional annotations for defined prokaryotic clades. Thus, we developed proGenomes, a resource that provides user-friendly access to currently 25 038 high-quality genomes whose sequences and consistent annotations can be retrieved individually or by taxonomic clade. These genomes are assigned to 5306 consistent and accurate taxonomic species clusters based on previously established methodology. proGenomes also contains functional information for almost 80 million protein-coding genes, including a comprehensive set of general annotations and more focused annotations for carbohydrate-active enzymes and antibiotic resistance genes. Additionally, broad habitat information is provided for many genomes. All genomes and associated information can be downloaded by user-selected clade or multiple habitat-specific sets of representative genomes. We expect that the availability of high-quality genomes with comprehensive functional annotations will promote advances in clinical microbial genomics, functional evolution and other subfields of microbiology. proGenomes is available at http://progenomes.embl.de

    Gut microbiota differs between children with inflammatory bowel disease and healthy siblings in taxonomic and functional composition: a metagenomic analysis

    Get PDF
    Current treatment for pediatric inflammatory bowel disease (IBD) patients is often ineffective, with serious side effects. Manipulating the gut microbiota via fecal microbiota transplantation (FMT) is an emerging treatment approach but remains controversial. We aimed to assess the composition of the fecal microbiome through a comparison of pediatric IBD patients to their healthy siblings, evaluating risks and prospects for FMT in this setting. A case-control (sibling) study was conducted analyzing fecal samples of six children with Crohn's disease (CD), six children with ulcerative colitis (UC) and 12 healthy siblings by metagenomic sequencing. In addition, lifetime antibiotic intake was retrospectively determined. Species richness and diversity were significantly reduced in UC patients compared with control [Mann-Whitney U-test false discovery rate (MWU FDR) = 0.011]. In UC, bacteria positively influencing gut homeostasis, e.g., Eubacterium rectale and Faecalibacterium prausnitzii, were significantly reduced in abundance (MWU FDR = 0.05). Known pathobionts like Escherichia coli were enriched in UC patients (MWU FDR = 0.084). Moreover, E. coli abundance correlated positively with that of several virulence genes (SCC > 0.65, FDR < 0.1). A shift toward antibiotic-resistant taxa in both IBD groups distinguished them from controls [MWU Benjamini-Hochberg-Yekutieli procedure (BY) FDR = 0.062 in UC, MWU BY FDR = 0.019 in CD). The collected results confirm a microbial dysbiosis in pediatric UC, and to a lesser extent in CD patients, replicating associations found previously using different methods. Taken together, these observations suggest microbiotal remodeling therapy from family donors, at least for children with UC, as a viable option.NEW & NOTEWORTHY In this sibling study, prior reports of microbial dysbiosis in IBD patients from 16S rRNA sequencing was verified using deep shotgun sequencing and augmented with insights into the abundance of bacterial virulence genes and bacterial antibiotic resistance determinants, seen against the background of data on the specific antibiotic intake of each of the study participants. The observed dysbiosis, which distinguishes patients from siblings, highlights such siblings as potential donors for microbiotal remodeling therapy in IBD
    • …
    corecore